On local equilibrium equations for clustering states 



Giorgio Parisi 

Dipartimento di Fisica, Sezione INFN, SMC and UdRml of INFM, 
■ Universita di Roma "La Sapienza" , 

O ; Piazzale Aldo Moro 2, 1-00185 Rome (Italy) 

^ ■ February 1, 2008 

m ; Abstract 

In this note we show that local equilibrium equations (the generalization of the TAP equa- 
^ ■ tions or of the belief propagation equations) do have solutions in the colorable phase of 

I the coloring problem. The same results extend to other optimization problems where the 

^ ' solutions has cost zero (e.g. K-satisfiability) . On a random graph the solutions of the local 

CJ ■ equilibrium equations are associated to clusters of configurations (clustering states). On 

a random graph the local equilibrium equations have solutions almost everywhere in the 
CN I uncolored phase; in this case we have to introduce the concept quasi-solution of the local 

' equilibrium equations. 

o 

^ 1 Introduction 

O I Statistical mechanics of complex systems, in the limit where the number of degrees of freedom 
^ ■ goes to infinity, has been widely investigated in these recent years .Ij. A crucial role is played by 
^ ■ the local equilibrium states: often a system is declared complex only if the the number of local 
• ^ ',. equilibrium states is exponentially large. Intuitively we would say that there are many local 
^ ' equilibrium states if the configuration space breaks down into many unconnected (or connected 
d ^ through channels of low probability) clusters of configurations. The breaking of the configuration 
space into an exponential number of clusters corresponds to the breaking of the replica symmetry 
and it is one of the crucial phenomena described by the replica approach [H Ej . 

It should be possible to make a precise mathematical statement that corresponds to the 
intuitive notion of having an exponentially large number of states in the limit N going to infinity. 
Unfortunately a mathematical precise definition of a local equilibrium state for a given system 
with large N is not easy, especially when the number of a local equilibrium states is very large. 
A tentative definition of a local equilibrium state has been done using dynamical ideas jS]. One 
introduce a natural dependence of the configurations on the time and a local equilibrium state 
is a region where the system may remain for an exponentially large time. 

This definition may be adequate for some purposes, but it is not so natural as far as it can 
be done only after the introduction of a dynamics. Often one try to associate local equilibrium 
states to solutions of the local equilibrium equations. 

The local equilibrium equations are called in the literature under different names in different 
contexts: in statistical mechanics they are the TAP equations [3], in optimization problems 
the finite temperature TAP equations correspond to the 'Sum-Product" algorithm and the zero 



temperature limit of the TAP equations are the behef propagation equations that correspond to 
"Min-Sum" algorithm 0ini- 

One difficulty with the approach based on the local equilibrium equations is that at the 
present moment we do not have a clear idea if these equations do have a solution and, if solutions 
do exist, how are they related to local equilibrium states. 

The aim of this note is to get a better understanding of these matters; this will be done 
by presenting explicit solutions (or quasi solutions) of the local equilibrium equations at zero 
temperature. This will be done for problems defined on a random graph with a finite number 
of edges starting from each node. The situation in infinite range models, like the Sherrington 
Kirkpatrick model, is likely different. 

These ideas have been used to study problems defined on random graphs. The prototype of 
a random graph we will consider in this paper is a randomly chosen graph among all possible 
graphs with nodes and M = aN links. 

The same considerations can be done also for other problems, like the k-satisfiability, where 
similar computations have been done. 

2 Coloring a colorable graph 
2.1 Whitening a colored graph 

Let us consider a graph with nodes and M = aN links. Our task is to color it by assigning 
a color to each node in such a way that two adjacent nodes have different color. The number of 
colors is fixed and it is equal to q. 

The interesting situation is where the graph is colorable in many different way (i.e. much 
large of q\). If the graph is colorable, we are interested in knowing its entropy density S {NS is 
the total entropy) defined by 

^{coloring) = exp{NS). (1) 

For a colorable graph our aim is to associate to each coloring a solution of the local equilibrium 
equations, that will be defined later. We proceed in the following way. A coloring is a function 
c{i), where z is a node and c{i) can takes integer values in the interval [1 — q]- The coloring is 
legal if c(z) ^ c{j) for all nodes adjacent to i. 

Starting from a legal coloring we can introduce a whitening procedure. We examine each 
node. We have two possibilities: 

• The color of the node is blocked and it cannot be changed producing a legal coloring: this 
happens if the node has neighbours with exactly q — I different (non-white) colors. 

• The color of the node is not blocked and it can take at least two different legal colors: this 
happens if the node has neighbours with less than q — 1 different (non-white) colors. 

We assign to the node i the white color (conventionally 0) if we can produce a legal coloring by 
changing the color of the node i without changing the color of the adjacent nodes (i.e. in the 
second case). This procedure is repeated by considering the non-white adjacent nodes until no 
nodes can be whitened ^. 

In other words a coloring is legal if 

Sci^,,cU) =0 V J G A{i) , (2) 

^Similar ideas are common in the literature: sometimes white node are caUed spin fou.. In the approach of 
(3 El their cavity fields are zero. They play an important role and they are called jokers in [H]). 



where A{i) is the set of nodes adjacent to i. A whitening w{i) can takes integer values in the 
interval [0 — g] ; it is legal if 

w{i)w{j)5y,^i)^^(^j) = W j e A{i) . (3) 

A whitening is extremal if it cannot be whitened. In this way we associate to each coloring 
an extremal whitening. It is crucial that at each step the number of white nodes increases, so 
that the whitening procedure must stop somewhere. 

It is possible to prove that the extremal whitening associated to a coloring is unique, i.e. it 
does not depend on the order in which we perform the whitening. Extremal whitenings satisfy 
simple local equations: a node is white if the adjacent nodes have less than q — 1 different non- 
white colors; a node is of the appropriate color if the adjacent nodes have exactly g — 1 different 
non-white colors. 

It is clear that sometimes the whitening produces a configuration where all nodes are white 
while in other cases we end in configurations in which some nodes are colored. The whitening 
process is supposed to evidentiate those nodes that are essential in the coloring (i.e. the nodes 
that remain non- white); this extends the notion of core introduced in [10 . We expect that in 
many cases a very large number of different legal coloring ends in the same extremal whitening. 

There are many questions that can be posed. 

• If an extremal whitening is obtained by whitening a legal coloring, how difficult is the 
reconstruction of a legal coloring starting from the whitening? Intuitively is should be 
easier that starting from scratch, but it is not easy to quantify it. 

• Are there solutions to the local equilibrium equations for an extremal whitening that cannot 
be obtained by whitening a legal coloring? 

The rational for introducing the whitenings is to extract some common properties from a 
some wide class of coloring (it extends the notion of core introduced in [lOj. It is easy to argue 
that in the general case there is an exponentially large number of different legal coloring that 
produce the same extremal whitening. 

2.2 Directional coloring and directional whitening 

A directional coloring is a function c{i\k), where k G A{i), that takes values in the interval [1 — 
It is a legal directional coloring if 

Sc{i\k),c{j\m) = V j G A{i) and j ^ k \/ m e A{j) and m^i . (4) 

In some sense c{i\k) is a color that the node i could have in absence of the adjacent node k. 
If we set 

c{i\k) = c(i) (5) 

and c{i) is a legal coloring, c{i\k) is a legal directional coloring. In the same way a directional 
whitening is a similar function w{i\k) that takes values in the interval [0 — g]. It is a legal 
whitening if 

w{i\k)w{j\i)6w(i\k),w{j\m) = 0, V j e A{i) and j^k V m e A{j) and m^i. (6) 

We can now start from a legal coloring, we produce a legal directional coloring and we begin 
a whitening procedure where a,w{i\k) is set to zero if we can produce an other legal directional 



whitening by changing the color w{i\k) without changing the color of the adjacent nodes. The 
procedure goes on up to the point where we reach an extremal directional whitening. 

Also in this case an extremal directional whitening can be characterized by local equations: 
it a legal whitening and it cannot be whitened; w{i\k) is not white if and only if there are exactly 
q — 1 different non-white colors in the set formed by w{j\m) with j G A{i), m G A{j), j k 
and j 7^ m. The local equilibrium conditions for a directional whitening to be extremal are 
equivalent to the TAP equations and to the belief equations in this context. 

It is possible to prove that this construction is unique and there is one and only one extremal 
directional whitening associated to a given legal coloring. For such an extremal directional 
whitening, for a given i all non- white w{i\k) must have the same color. Also if this condition is 
satisfied it is not evident if there are of extremal directional whitening that cannot be obtained 
by whitening a legal coloring. 

Up to now we have only given definitions. The extremal whitening are useful because of the 
following three theorems. 

(A) If the graph is a tree and if we fix the value of the whitening on the leaves of the tree 
there is only one extremal whitening on the rest of the tree. 

(B) If we consider two different legal colorings and the nodes where the two colorings differ 
are contained in the interior of a tree (that is a subgraph of the original graph), the extremal 
directional whitening corresponding to these two coloring do coincide. The same results hold if 
the nodes where they differ are contained in the interior of some non-overlapping trees. 

Theorem (A) is a particular case of the well known theorem on the convergence of the belief 
algorithm on a tree |6i (an elementary proof for K-satisfiability can be found in [T^). 

The proof of the theorem (B) is trivial. We start the whitening procedure in the interior of 
the tree. According to theorem (A) the whitening procedure will converge to a result that does 
not depend on the initial conditions. The difference among the different coloring is thus wiped 
up. We can now whitening the rest of the graph and the results are obviously equal in both 
cases. 

If we consider a random graph and a node i of the graph. Let us call Q{N, a, L) the probability 
that two colorings that differ on some nodes, all of them at a distance less than L from the node 
i, produce two different extremal directional whitenings. We can prove the following theorem: 

(C) The probability Q{N, a, L) goes to zero when goes to infinity at fixed a. 
Theorem (C) is a trivial consequence of theorem (B) and of the fact that for a random graph, 

when goes to infinity at fixed a, the set of nodes at distance less than L from a given node 
belongs to a tree with probability 1. In this situation a random graph is locally a tree. 

Theorem (C) implies that when N goes to infinity in a random graph two colorings must 
be different in a large number of nodes in order to produce two different extremal directional 
whitenings. In other words, with probability one, colourings that differ only in a fixed amount 
of nodes do produce the same extremal directional whitening. 

The local equilibrium equations for an extremal directional whitening are the zero tempera- 
ture limit of the TAP equations for the coloring problem; it is believed that on a random graph 
we associate a different local equilibrium state to each different solution of such equations . 

3 Coloring uncolorable graphs 
3.1 Directional whitening does not work 

In the case where the graph if not colorable we can ask how to colour it in such a way to minimize 
the number of edges of different color. In this case we would like to find the minimum of the 



quantity 



H 



J2 ^cii),cU)) ■ 



(7) 



We will say that a given configuration is a local minimum of order k (or /c-stable) if the 
energy cannot decrease if we change the color in a set of nodes that are located at distance equal 
or less than k from a given node [3j. If k is large enough the local minimum coincides with the 
global minimum. 

We are interested to study the case of a random graph when goes to infinite at fixed a in 
the regime where 



We could firstly try to use the same approach of the previous section. 

The most naive approach would be to start from the actual coloring and to construct a 
directional whitening by choosing the color of the node i in absence of the edge from i to j by 
knowing the colors of the others nodes near to i. If more that one color is possible, we can set 
the color to white (in certain cases we should also remember the list of the allowed colors for a 
non- white node). 

However here in this process a node can go from a non-white color to an other non-white 
color (in one or two steps) and the iterative process may not converge fT^ ITT] . 

There is a very simple case that displays this behaviour. We consider a ring with nodes, 
being an odd number: the simplest case is = 3 and q = 2. The iterative equations are 



where the sums are done modulo A^. The function / is defined by /(I) =2 and /(2) = 1. 

It is evident that the previous equations have no solution and, if we iterate then, we find an 
oscillatory sequence that does not converge. A similar effect is present in the case of g = 3 and 
we have an clique of 4 points (a tetrahedron). 

Although we can formally write equations that correspond to extremal directional whitening 
(i.e. the local equilibrium equations), there are graphs such that these equations do not have 
solution and it is not clear what happens in the random case for large A^. Something different 
must be done. 

3.2 Quasi-solutions 

In spite of the fact that the local equilibrium equations may have no solution, we would like 
to prove that we can associate to a local minimum of order k a quasi-solution of the local 
equilibrium equations. As we shall see a quasi-solution is a function that satisfies the local 
equilibrium equations nearly everywhere (i.e. in most of the nodes, the fraction of nodes where 
the local equilibrium equations are not satisfied goes to zero when A^ goes to infinity). In the 
example of previous section is easy to find functions that satisfies equation 1^1 in all points by one 
(e.g. c(i, z -|- 1) = 1 for i even and c(i, z + 1) = 2 for z odd). 

The proof runs as follows. Let us consider a local minimum of order k (c*) and a node i. The 
quantity Ab(c; i) is the minimum of H — H* when the color at i is given by c{i) and the minimum 
is done over those configurations that differs from a* only in nodes at a distance from i less than 
b with b < k. In the previous notation a node is white if there are al least two different colors 
such that Afe(c; i) = 0. 

The quantity Ab(c; i) is a bounded decreasing function of b so that the dependance on b is 
small when b is large. In the case of bicoloring 



1 << A; << A^ . 





(9) 



Ab{c;i) = hb{i){c{i) -c*{i)) , 



(10) 



where the quantity hb{i) defined by the previous equation is the effective cavity field. 

In a similar way we can define the quantity Ab(ci, C2, C3; zi, 22, ^3): it is the minimum of 
configurations that differs from a* only in nodes at a distance from ii or 22 or i^ less than b with 
b < k. 

In the case of a random graph we have that 

Af,(ci, C2, C3, ii, Z2, ^3) = Afe(ci; ii) + Afe(c2; 22) + Ab(c3; 23) , (11) 
with a probability R{N, a, b) such that 

lim R{N,a,b) = 1 . (12) 

A'' -+00 

The proof is trivial. When goes to infinity at fixed b and a the set of nodes Nk at distance 
less than b from ik has no intersection with probabihty 1, and the function Afc(ci, C2, C3; ii, ^2, ^3) 
decomposes into the sum of three independent contributions. 

We now construct the local equilibrium equations for this problem. For each site we define 
the function Ah{c;i,j), that is defined in the same way as the the function Ah{c;i), with the 
only difference that it is computed for the problem where the edge {i,j) is removed. 

It is evident that Ab(c;z,j) is a function of all Afe_i(c;/c,m) (for k G A{i) with k ^ j and 
m G A{j) with m ^ i) with probability 1 (i.e. in all the cases the appropriate factorization 
equation holds). The explicit expression is rather lengthy and it will be not written here |HI. We 
can write it 

A,(c;z,j) = i^({A,-i(A:,m)}) (13) 

In this way we compute the the quantities A with a value of b using those with b — 1. However 
in most of the cases for large b the functions Aft_i(c; i,j) and Ab{c; i,j) are the same, so that in 
the region 1 << b « N the quantities Ab{c{k,i)) satisfies with probability 1 the equations 

Ab{c-z,j) = F{{Ab{k,m)}) (14) 

These equations are the local equilibrium equations of this problem. The probability that 
the factorization equation (fTTj) is not true at a given node, goes to zero with N goes to infinity, 
however the number of nodes where it is not satisfied may remains constant or go to infinity. In 
this case we are not able to prove that the local equilibrium equations do have a solution, but 
they have a quasi solution, i.e. there are functions A(c; i, k) that satisfy the local equilibrium 
equations 

A{t,j) = F{{A{k,t)}) (15) 

in most of the sites. More precisely 

E,,gACo|A(z,j)-F({A(A:,z)})| 

N ^ ' 

should be smaller that an appropriate function of that goes to zero when goes to infinity. 

In this way we can explicitly associate a quasi-solution of the local equilibrium equations to a 
fc-stable configuration. Also in this case we expect that many k stable configurations (an whole 
equilibrium state) do correspond to the same solution of the TAP equations. 



4 Counting the whitening 



The existence of more than one extremal directional whitening corresponds to breaking the phase 
space into different clusters. This is a very interesting phenomenon. We can conjecture that 



the there is an one to one correspondence among extremal directional whitenings and clusters 
of solutions (pure states or lumps). A natural question is to compute the number of extremal 
directional whitenings that are produced starting by a legal coloring. 

On a random graph this can be done analytically using the so called survey propagation 
equations |H1 El El CSl E| • For reader convenience let us report the main results of JT^. For 
a < ad there is only one whitening, i. e. the trivial one: all white. For ad < a < ac there is 
an exponentially large number of solutions, that disappear at ac, where the graphs are no more 
colorable. In the region ad < a < ac the total number of extremal directional whitenings is 
given by the relation: 

^^{extremal directional whitening) = exp(A^S(a)) . (17) 

The complexity S(a) has been computed. It start from a non-zero value at ad, it decreases 
by increasing a and becomes zero at ac- For a > ac directional extremal whitenings disappear 
and a generic graph is no more colorable ^3 Ej ■ 

The same techniques we have applied to the local equilibrium equations can be used to study 
survey propagation equations. The aim of these equations is to characterize the set of all the 
extremal directional whitenings. Let us define the survey fj{i): the g + 1 components of this 
vector are the fraction r]c{i) of extremal whitenings (obtained by whitening a legal coloring) that 
have the color c at the point i. For symmetry reasons we must have that 

Vc{i) = vi"^) for c = 1, g 

r]o{i) = 1 - qrjii) . (18) 

Let us denote by ?7ci,c2(^i5 "^2) the fraction of extremal directional whitenings that are obtained 
by whitening a legal coloring that have the color ci at the point ii and the color C2 at the point 
i2- We can hope that (in analogy with equation [TT|) the following factorization property holds: 

Vci,c2{h,i2) = Vci{ii)Vc2{ii) ■ (19) 

Such an equation cannot be always exact: it is false if ii and 22 are nearby and it is valid only 
with probability 1 when goes to infinity. 

Let us define the survey fj{i,j)] the q + 1 components of this vector are the fraction ric{i,j) 
of extremal directional whitenings (obtained by whitening a legal coloring) that have the color 
c at the point i when the edge from i to j is removed.. 

Using the same argument used in the derivation of the local equilibrium equations, one can 
show that 

ff{z,j) = GiMk,z)}) , (20) 

where the function G has an explicit form, computed in ^1]. The previous equations are the 
survey propagation equations. 

The function G is defined as following: let us call Aj[i) the set of adjacent nodes of i that 
are different from j and let as call d its cardinality. We extract d — 1 colors, each according to 
the probability f]{k,m) {k G Aj{i)) that are interpreted as colors of a directional whitening. 

We call Zj{i) the probability that the number of different non- white colors is less or equal to 
q — 1. The quantity Zj{i) is the probability that we can construct a legal directional whitening 
starting from a whitening that does not contains the node i and adding the edges from the node 
i to the nodes in Aj{i). The quantity rjc are the probabilities (divided by Zj{i)) that there are 
exactly g — 1 different non-white colors and that the missing color is c. 



Using the solution of the previous equations, that is supposed to be unique [15^ on a random 
graph for large N, we can estimate the complexity density [121 CHI, using 



(21) 



where Z{i,j) is the probability that, when we extract a color in i with probability fKijj) and we 
extract a color in j with probability f]{j,i), the two colors are different. In other words Z{i,j) 
is the probability to be able to produce a legal directional whitening if we add the edge from i 
to j to a graph that does not contains it. In the same way Z{i) is the probability to be able 
to produce a legal whitening if we add the node z to a graph that does not contains it and is is 
given by 



and it does not depend on the choice of of j G A{i). 

The arguments leading to the previous equations will be not reported here and they can be 
found in the original papers. 

5 Conclusions 

In the colorable phase for coloring (or in the satisfiable phase for K-satisfiability) we have shown 
that one can associate solutions of local equilibrium equations to extreme directional whitenings 
that are generated starting from legal configurations. We also conjecture that a cluster of 
solutions correspond to one extreme directional whitening. If the legal configurations cluster 
in many different states, the existence of many extremal directional whitenings is a signal of 
the existence of many states. If we identify, as supported by these considerations, extremal 
directional whitenings with local equilibrium states, the existing predictions on the number 
(and on the other properties) of local equilibrium states, [121 UlI can be tested by studying the 
extremal directional whitening. The construction and the counting of the extremal directional 
whitening is numerically an easy job, so that the prediction of this note and of the whole replica 
based approach 1^ can be tested numerically without difficulties. 

An other important point is the proof that the local equilibrium equations (e.g. TAP equa- 
tions or belief equations) in the uncolorable phase must be satisfied nearly everywhere, but no 
argument can be done that implies that the equations should be satisfied everywhere: quasi 
solution must exist but bona fide solutions may be missing. 
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